International Journal of Special Education 
2004, Vol 19, No.2. 


ATTENTION DEFICIT HYPERACTIVITY DISORDER AND RATING SCALES WITH 
A BRIEF REVIEW OF THE CONNORS TEACHER RATING SCALE (1998) 


Matthew Cordes 

and 

T. F. McLaughlin 

Gonzaga University 


This paper explores the Diagnostic and Statistical Manual of Mental Disorders 
(DSM-IV; American Psychiatric Association, 1994) definition of Attention Deficit 
Hyperactivity Disorder) (ADHD). The use of rating scales to diagnose ADHD was 
evaluated. Rating scales have been used since the 1970’s and are highly influential 
in the detection of ADHD today. We also examined the advantages and 
disadvantages of using rating scales. Rating scales seem to change the percentage of 
individuals who meet the criteria for a diagnosis of ADHD based on different age 
groups, gender, and ethnicity. Finally, the Conners Teacher Rating Scale (CTRS-R) 
(Conners, Sitarenios, Parker, & Epstein, 1998) was reviewed. The important 
changes in the revision of the Conners Teacher Rating Scale-Revised ( CTRS-R) were 
detailed. The use of the CTRS-R is recommended as part of a multi-faceted 
assessment to diagnose and evaluate treatment procedures for children and 
adolescents with ADHD. 

The original Conners’ Teacher Rating Scale (1968)was a 39-item rating scale used to determine 
the teacher’s view of a child’s behavior in the classroom (Conners, 1969). This scale was recently 
revised (CTRS-R) in 1998 (Conners, Sitarenios, Parker, & Epstein, 1998) and resulted in a 28- 
item scale. The revision of the original Conners’ Teacher Rating Scale was shortened to 
accommodate the administrator of the test so that the information would be more easily gathered. 
It has also changed from a 3 to 4 point Likert scale. This scale ranges from 0, not at all true, to 3, 
very true (Conners et ah, 1998). 18 of the items on the CTRS-R were written to mirror closely the 
DSM-IV criteria for ADHD. The rest of the items came from various tests such as the previous 
Conners’ which have been shown to have clinical utility (Brown, 1985; Satin, Winsberg, Monetti, 
Sverd, & Foss, 1985; Stein & O'Donnell, 1985) and reliability (Conners, 1969; Edelbrock, 
Greenbaum, & Conover, 1985; Epstein & Nieminen, 1983; Roberts, Milich, Eoney, & Caputo, 
1981; Zentall & Barack, 1979). There are also shortened versions of the original CTRS such as 
the IOWA Conners (Pelham, Milich, Murphy, & Murphy, 1989) and Abbreviated Symptom 
Questionnaire (Sprague & Sleator, 1973). These shorter versions are also very popular with 
teacher because of the brevity of time needed to complete the scale. 

In recent years the one of the most common reasons for referral to mental health practitioners in 
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the United States has been the parents’ concern regarding their child and Attention Deficit 
/Hyperactivity (ADHD). ADHD is one of the most prevalent childhood psychiatric disorders 
(Barkley, 1998; Hundhammer & McLaughlin, 2002). Around 19% of the American school 
children have some type of behavior problem. Half of these children have either a hyperactivity or 
inattention problem (Smucker, 2001). Thus ADHD has a high incidence of occurring with other 
problems (National Institute of Health, 2001). According to the Surgeon General's report, 3-5% of 
American school children have ADHD (American Psychiatric Association, 2000). Earlier 
estimates have said that 1-20% of the population has ADHD (Barkley, 1998). This disorder is 
diagnosed in boys more then girls in ratios ranging from 4:1 to 9:1. This difference is based on 
where sample is taken, whether clinical or community based (Reid, 2001). A formal definition of 
ADHD follows: 

ADHD refers to a family of related chronic neurobiological disorders that interfere with 
an individual's capacity to regulate activity level (hyperactivity), inhibit behavior 
(impulsivity), and attend to tasks (inattention) in developmentally appropriate ways 
(National Institute of Health, 2003). 

The three main characteristics of ADHD are hyperactivity, impulsivity, and inattention. Children 
with ADHD have shown to score an average of 7 to 15 points below the control groups on 
standardized intelligence tests (Laraone et ah, 1993; Lischer, Barkley, Lletcher, & Smallish, 1990; 
McGee Williams, Moffitt, & Anderson, 1989; Prior, Leonard, &Wood, 1983; Tarver-Behring, 
Barkley & Karlsson, 1985; Werry, Elkind, & Reeves, 1987). The reason for this difference is not 
clear, and it has been speculated that this may be the result of the test taking behaviors found in 
children with ADHD. The inattentive response of children with ADHD could lead to answers 
which were given in haste and therefore wrong (Barkley, 1998). Some studies have shown that the 
relationship between low test scores and ADHD comes from learning disabilities (LD) found in 
the children with ADHD (Bohline, 1985). However in a study carried out by Barkley et ah, found 
that children with ADHD/LD had higher test scores then children diagnosed with only LD 
(Barkley, 1990). 

Children with ADHD have shown intelligence quotients from mildly mentally intellectually 
retarded to gifted. Besides scoring lower on intelligence testing, children with ADHD have shown 
some difficulty in the classroom. Children with ADHD normally have trouble in two areas: 
academic performance and achievement. Most children referred to clinics are performing well 
below their known levels of abilities (Barkley, 1998). Evidence supports that medication will help 
some school children with academic productivity and accuracy (Barkley, 1977; Pelham, Bender, 
Caddell, Booth, & Moorer, 1985; Rapport, DuPaul, Stoner, & Jones, 1986). Even with 
medication, students are still showing below average performances on standardized tests 
compared to their peers by as much as 10 to 30 points in a multitude of various subject areas; 
reading, spelling, math, and reading comprehension (Barkley, DuPaul, & McMurray, 1990; Brook 
& Knapp, 1996; Cantwell & Satterfield, 1978; Casey, Rourke, & Del Dotto, 1996; Dykman & 
Ackerman, 1992; Fischer et ah, 1990; Semrud-Clikeman et al., 1992). 

56% of children with ADHD require tutoring, 30% are forced to repeat a grade, and 30-40% must 
be placed in special education program (Barkley, 1998). As many as 46% of ADHD children may 
be suspended from school and 10 to 35% may drop out of high school (Barkley, Fischer et al., 
1990; Barkley DuPaul & McMurray, 1990; Brown & Borden, 1986; Faraone et al., 1993; Munir, 
Biedermen, & Knee, 1987; Stewart et al., 1966; Szatmari, Offord, & Boyle, 1989; Weiss & 
Hechtmen, 1993). Some children with ADHD show social deficits. As adolescents they are more 
likely for motor vehicle accidents, tobacco use and teenage pregnancy (National Institute of 
Health, 2003). ADHD is a serious disorder and a reliable way to diagnose needs to be developed. 


24 



INTERNATIONAL JOURNAL OL SPECIAL EDUCATION 


Vol 19, No.2. 


DSM-IV and ADHD 

The most commonly used diagnostic criteria used in the United State are the Diagnostic and 
Statistical Manual of Mental Disorders (DSM-IV; American Psychiatric Association, 1994) and 
the International Classification of Diseases. This test is very similar to the DSM-IV and is used 
internationally (ICD-10; World Health Organization, 1994). According to Barkley, (1998) the 
DSM-IV are some of the most rigorous and empirically derived criteria ever created in the history 
of clinical diagnosis for this disorder. This version of the DSM was based on items used in rating 
scales that have a high intercorrelation to each other and are valid in distinguishing ADHD from 
other groups of children (Spitzer et ah, 1990). The DSM-IV criterion has different symptoms for 
inattention and hyperactivity. Dividing the criteria into these categories allows diagnoses of 
ADHD and ADD in one set of criteria (Barkley, 1998). All of the symptoms must be present for 
at least 6 months and must reach at least six of the nine criteria in the category (American 
Psychiatric Association, 1994). The other conditions which must be present for diagnosis of 
ADHD is that must appear before age seven, must be present in two or more settings, there must 
be clear evidence of clinically significant impairment in social, academic, or occupational 
function, and symptoms do not occur during the course of another psychotic disorder, and lastly 
are not better accounted for by another mental disorder (American Psychiatric Association, 1994). 

One of the problems with the DSM-IV is its treatment of ADHD- predominantly inattentive (PI). 
The main difference between ADHD-PI and ADHD-combined (C) is that ADHD-PTs main 
problems come with focused/selective attention and sluggish information processing, whereas the 
ADHD-C focuses on the problem of persistent effort and distractibility (Barkley, 1998). This 
would mean that ADD and ADHD will be classified as different disorders and the rating scales 
accompany each will have to change. However, there has not been enough research on this topic 
to make a definite separation of the two (Barkley, 1998). 

A second important concern is the appropriateness of the items for different developmental 
periods (Barkley, 1998). This was not the case for inattention. Harts’ recent research showed that 
inattention remained stable from middle childhood to adolescence (Hart et ah, 1995). Thereafter, 
hyperactivity has been shown to decline as ADHD children reach adolescence (Barkley, 1998). 
Although the hyperactivity level of those with ADHD statistically declines as they get older, 
Barkley’s findings show that many ADHD children still show signs of hyperactivity even though 
they don’t qualify for the diagnosis. Therefore this apparent decline in the percent of those 
diagnosed is not in correlation with the decline in the severity of symptoms used in the criteria 
(Barkley, Lischer, Edelbrock, & Smallish, 1990; Lischer, Barkley, Edelbrock, & Smallish, 1993). 
In Lischer’s study, little difference was found in the measure of the impairment between those no 
longer meeting diagnostic criteria for ADHD and those still doing so (Pisher et ah, 1990). 

All reliable ratings in the United States are based on the DSM-IV. However this is not the case 
with other nations. In Germany researchers using the DSM-IV found that 17.8% of the children 
had ADHD (Baumgaertel et ah, 1995). This is not just the case with Germany, many countries 
have an inflated rate of ADHD using the DSM-IV criteria (Rohde, 2002). This is a very important 
statistic when over 1/3 of the American school population is from another culture, with Hispanics 
being the largest (Reid, 1995). This research is something that rating scales need to consider 
when trying to meet the criteria for DSM-IV. 

Rating Scales 

The use of rating scales has always been one of the major ways of diagnosing a person with 
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ADHD (Barkely 1998; DuPaul & Stoner, 1994). Rating scales have become one of the 
essential elements in the evaluation and diagnosis of children with behavior problems 
(Barkley, 1998). 

Clinicians and researchers consider the use of reliable and valid teacher-completed rating 
scales as standard practice for the diagnosis and treatment of attention deficit/hyperactivity 
disorder (ADHD) (American Academy of Child and Adolescent Psychiatry, 1997; 
American Academy of Pediatrics, 2000), (Mattison, Gadow, Sprafkin, Nolan, Schneider, 
2003). 

The rating scale first began to affect the diagnosis of ADHD with the Conners’ Teacher Rating 
Scale (CTRS) in 1969 (Conners, 1969,1 973). (Gumpel, Wilson, Shalev, 1998). Prom the early 
1970’s on, there has been a professional preference for the rating scale (Gumpel, Wilson, Shalev, 
1998). Before the last decade it has been accepted that the expression, course, and outcome of 
psychological disorders were largely universal and independent of each other (Marsella & 
Kameoka, 1989). This is another reason that rating scales were so heavily depended on without 
question (Reid, Casat, Norton, Anastopoulos, & Temple, 2001). 

Parental ratings have also been shown to be reliable and full of good information (Glascoe & 
Dworkin, 1995). Glascoe & Dworkin state that the accuracy of information from parent ratings 
helps clinical diagnosis of ADHD. Recent studies have shown that correctly interpreted parental 
concern expressed on rating scales can be just as accurate as developmental-behavioral screening 
tests when trying to detect children with disabilities (e.g.. Diamond, 1987; Glascoe, 1997a,1997b, 
2000; Glascoe, Altemeier, & MacLean, 1989; Mulhem, Dworkin, &Bemstein, 1994; Thompson 
& Thompson, 1991; Young, Davis, Schoen, & Parker, 1998). In another study by Glascoe and 
colleagues, it was found that using the Parents' Evaluation of Developmental Status (PEDS) 
expresses a high sensitivity with certain parents concerns (e.g., motor, language, global cognitive, 
academic) when identifying disabilities with children four years and older (Glascoe, 1991, 
1994,1997a, 1997b, 2000; Glascoe et ah, 1989; Glascoe & MacLean, 1990; Glascoe, MacLean, & 
Stone, 1991). The opposite was found to be true, the rating scale shows that parents of children 
without disabilities do not have the previously expressed concerns (Dewey, Crawford, & Kaplan, 
2003). Obviously, rating scales are a useful way of presenting questions to parents to receive 
information. In a study done by Mash and Terdal, it was reported that parents are an important 
source of information. They are familiar with the child’s history and current situation, and have 
the cognitive skills that enable them to recognize and describe their child’s psychopathology and 
learning difficulties (Mash & Terdal, 1997). Parents can identify the strengths and weaknesses of 
the child that may not be assessed by standard psychometric testing (Dewey et al. 2000). 

Teacher rating scales are also an important part of the evaluation and diagnosis of ADHD (Miller, 
Koplewicz, & Klein,1997). Teacher rating scales provide necessary information about the child in 
the school setting. The teacher also becomes a secondary informant who can judge the behavior of 
the child in the context of his peers (Matson, 1993). Teacher rating scales on average are more 
reliable then parent rating scales and tend to be more sensitive to hyperactive behaviors (Barkley, 
1981). All ratings scales have some problem with reliability and validity; however, when 
combining multiple aspects of evaluation, ratings scales are proven to be a useful tool in 
establishing the presence of symptoms, their onset and duration, pervasiveness, and their 
statistical deviance compared to a normal child of the same age (Barkley, 1981). 

Rating scales are a valuable tool in the assessment of ADHD; however, there are many factors that 
can affect the reliability and quality of a rating scale (Parker, 1992). Many of the most 
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commonly used rating scales have similar factors that make them effective. Lor example, they 
offer a quick and easy format, standardized sets of behaviors to insure that specific behaviors are 
being assessed, are economical in terms of cost and time, and reduce rater bias and subjectivity by 
using standardized presentation of questions. They also provide a means of evaluating frequency 
and severity of behaviors with age and gender specific norms to compare to and can be given to 
both parents and teachers (showing the problem in multiple settings required by DSM-IV). 
Normative samples include children form different socioeconomic and racial and ethnic 
populations, and are multifactorial (Lerner J., Lowenthal, & Lemer, S.1995). 

A major concern with rating scales is that they are thought to be gender biased. There have been 
numerous studies completed which show that girls differ from boys in the rate of ADHD (Arnold, 
1996; Gaub, & Carlson, 1997b). However, the DSM-IV does not take gender into account when 
putting out its criteria (Collett, Ohan, Myers, 2003). Unfortunately, this may mean that the DSM- 
IV may not be demonstrated as accurately in females as it is in males (Ohan & Johnston, 1999; 
Quinn & Nadeau, 2002). This issue leads to the subsequent misdiagnosis of females. 

Another valid concern with rating scales is that they were created for elementary age school 
children and did not account for children under age four or adults (Collett, Ohan, Myers, 2003; 
Hundhammer & McLaughlin, 2002). Barkley specifically makes note of this by saying: the 
appropriateness of the item set for developmental period (Barkley, 1998). This concern is 
specifically addressed in terms of adults who display impulsivity and some aspects of 
hyperactivity, but don’t quite meet the criteria for hyperactivity (Barkley, 1998). There has been 
very little research done in the area of adulthood ADHD and how the DSM-IV criterion conforms 
to college students in the United States (DuPaul, Schaughency, Weyandt, Tripp, Kiesner, Ota, & 
Stanish, 2001). Adults rely heavily on self-reported scales diagnosis with ADHD (Dulcan, 1997; 
Jackson & Larrugia, 1997). Heiligenstein et ah, (1998) performed a study using 448 university 
students that showed 4% of the participants had ADHD; this was found by using DSM-IV criteria. 
However, when Heiligenstein et al., performed a similar study with self-rating scales, the percent 
of people who qualified for ADHD rose up to 11% (Heiligenstein et al., 1998). Therefore, 
Heiligenstein et al., (1998) decided that rating scale criteria brought forth by the DSM-IV is too 
high when applied to universities. Various researchers believe that students will underscore 
themselves on the tests; as a result of academic underachievement associated with ADHD will not 
be represented on the test (Javorsky & Gussin, 1994; Richard, 1995). A few studies regarding 
ADHD children into adulthood have been completed; however, this is an area that needs more 
extensive research (Barkley, 1998). 

The final major concern with rating scales is its correspondence to different ethnic groups. In the 
past, it was believed that expression, course, and outcome of psychological disorders were largely 
universal and was not effected by ethnicity (Marsella & Kameoka, 1989). Now researchers say 
that cultural bias is an issue with ADHD testing (Reid, 1988). Thus, the creditability of behavior 
rating scales to culturally or ethnically diverse people has been questioned (Reid, Casat, Norton, 
Anastopoulos, & Temple, 2001). With the increase of minority groups, which in some states equal 
half of the student population, the need of a scale that has stability through ethnicity is becoming 
more a prevalent issue (American Council on Education, 1988; Quality Education for Minorities 
Project, 1990). These are the main concerns regarding ADHD rating scales. More problems are 
being discovered facing rating scales but ethnic inclusion, gender bias and early childhood and 
adult ratings, are three legitimate concerns. 

Barkley (1981) stated the following criteria for rating scales: the scales should be worded so that 
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The majority of adults can understand them, the scale should have enough items to assess 
constructs under the study but not so many that it is overly time consuming, and the answer format 
should have the degree of problem being endorsed instead of only yes or no answers. Lurthermore, 
the test should focus more on the construct being tested, it should have construct validity which 
tries to correlate the symptoms of ADHD with the questions being asked, and it should also be 
similar to other useful measures in the field in the manner of criteria and outcomes. The rating 
scale should also be reliable for all individuals tested, and two different raters should be provided 
for each child, and the scale norms should include a large age range. Although it is not specifically 
mentioned, the normative group should include people from different parts of the country, several 
of those with various socioeconomic backgrounds, a range of ethnicity, and a number of 
individuals from both rural/ city surroundings (Reid, Casat, Norton, Anastopoulos, & Temple, 
2001). Linally they should provide some information on planning interventions after the 
completion of the test, the majority of the scales do not include this aspect. (Barkley, 1981). 

The Conners ’ Teacher Rating Scale ( CTRS and CTRS-R) 

The Conner’s Teacher Rating Scale (CTRS) is one of the three most popular rating scales used by 
professionals today for teacher rating scales to diagnose ADHD (Barkley, 2003. The CTRS 
consists of either 39 or 28 questions and addresses four factors: conduct problem, hyperactivity, 
inattentive-passive, and hyperactivity index (Parker, 1992). Although the scale has been used for 
over thirty years, it retains its professional appeal and has remained stable (Gumpel, Wilson, & 
Shalev, 1998). This scale covers children from ages 3 to 17. The CTRS-R has been shown to be 
an adequate scale for preschooler’s both in middle to upper classes, and in both predominantly 
white schools and in lower socioeconomic mainly African American schools (Miller, Koplewicz, 
&Klein, 1997; Lantuzzo, Grim, Mordell, McDermott et ah, 2001). 

Some studies have shown that age had an effect on the percent of people diagnosed with ADHD; 
stating that children are diagnosed more then adolescence (Rowe & Rowe, 1997). Some also 
argue that the CTRS focuses only on the negative aspects of the child (Achenbach, 1991; 
Achenbach & Edelbrock, 1983; Atkins et al., 1985; Lambert et ah, 1990; O'Leary & Steen, 1982; 
Pelham et al., 1992; Quay & Peterson, 1983; Rutter, 1967; Shaywitz et al., 1986; Ullmann et al., 
1985a). This can create a bias with the tester and increase the searching for a pathology instead of 
accurately recording the given information regarding true diagnosis (Rowe & Rowe, 1992). 
Originally the CTRS was developed as way to screen and test behavior problems when children 
took medication, this means that the scale is sensitive to medication. The CTRS is useful with a 
dual function for assessing psychosocial and drug treatment problems with children that have 
behavior problems (Horn, lalongo, Popovich, & Peradotto, 1987; Lischer & Newby, 1991). 
CTRS does show some test-retest reliability (i.e., test a child twice to see if the same conclusions 
come from the data). This was shown by coming up with similar findings when retesting children 
(Glow, Glow, & Rump, 1982). One of the major problems with the CTRS is that the norming 
group is quite small and from one section of the country, and a new larger group needs be tested 
from all over the country so that the scale can be kept up to date ( Conners, Sitarenios, Parker, & 
Epstein, 1998) . 

The newest version of the CTRS has a greater focus on ADHD-related behavior and relevance 
regarding the questions asked. The CTRS-R (1998) has more discriminatory status toward 
detecting ADHD then the previous version (Conners, et al., 1998). An additional finding from the 
Conners at al., (1998) was the scale should not be used alone, but instead, other diagnostic tests 
should be employed. The CTRS however seems to detect ADHD in more African American 
children then in white children (Conners et al., 1998; Langsdorf et al., 1979; Waechter et al.. 
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1979). More in-depth research regarding race found that an antisocial factor is found more in 
African American males and a socially inferior factor is found more in white females. The one 
study on the difference between teacher bias and the theory of a races acting showed differently 
the presence of teacher bias when examining white and Asian children (Sonuga, Burke, Minocha, 
Taylor, & Sandberg, 1993.). The final consideration was that when using the CTRS, the rater 
should take racial factors into account and to not only use the CTRS as the only diagnostic tool for 
detecting ADHD ( Epstein, March Conners, Jackson, 1998). In conclusion, the CTRS is a very 
reliable and prestigious scale, but the tester must be aware of some of the shortcomings of the 
scale. Rating scales are, and will continue, to be used as major diagnostic tool for the detection of 
ADHD school children for the foreseeable future. The rating scale is a device that most people can 
easily use and generalize to various subjects. The rating sale is a very quick way to gather 
information from the most important sources. Lurthermore, rating scales show that the behaviors 
are occurring in more then one setting. In addition, the scales allow for parents to provide 
information about their child in a clear and succinct way. The rating scale does have it problems 
such as gender, race, and age bias; however, these are things that will eventually improve and 
change with further revisions. One also has to consider that a truly universal and inclusive scale is 
nearly impossible to create. Most importantly, the rating scale has become one of the most useful 
diagnostic tools for detecting ADHD. 
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